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Measurements are presented of the jet invariant mass and substructure in proton-proton collisions 
at yfs — 7 TeV with the ATLAS detector using an integrated luminosity of 37 pb _1 . These results 
exercise the tools for distinguishing the signatures of new boosted massive particles in the hadronic 
final state. Two "fat" jet algorithms are used, along with the filtering jet grooming technique that 
was pioneered in ATLAS. New jet substructure observables are compared for the first time to data 
at the LHC. Finally, a sample of candidate boosted top quark events collected in the 2010 data is 
analyzed in detail for the jet substructure properties of hadronic "top-jets" in the final state. These 
measurements demonstrate not only our excellent understanding of QCD in a new energy regime 
but open the path to using complex jet substructure observables in the search for new physics. 



I. INTRODUCTION 

The hadronic final state may be explored in terms of the structure and shape of the hadronic energy flow via 
the internal hard and soft structure of jets - jet "substructure." Such an approach seeks to measure important 
aspects of QCD and to identify patterns and topological features that are potentially indicative of the presence 
of new physics. 

Hadronic decays of heavy particles such as W bosons [1-3], top quarks [4-8], the Higgs boson [9, 10], and 
potential new particles [11], may be collimated into a single heavy jet characterized by distinct substructure and 
large mass. In this analysis, two "fat" jet algorithms, along with the filtering jet grooming technique developed 
in the search for a boosted Higgs boson, are used to measure the jet mass and substructure. Results are presented 
at the particle level and compared to three Monte Carlo simulation programs. A first measurement of the jet 
mass scale uncertainty in ATLAS is made, and consequences for searches based on jet mass are discussed. An 
additional jet substructure observable, the first kt splitting scale or \J di2, is measured for the first time at the 
LHC. Finally, a sample of candidate boosted top quark events collected in the 2010 data is analyzed in detail 
for the jet substructure properties of hadronic "top-jets" in the final state. 

II. DATA SELECTION AND MONTE CARLO SAMPLES 

The ATLAS detector [12] provides nearly full solid angle coverage of the collision point with tracking detectors, 
calorimeters and muon chambers. Of these multiple subsystems the most relevant to the analyses presented 
here are the inner detector [13], barrel and endcap calorimeters [14, 15] and trigger [40]. 

The entire 2010 dataset is used for the analysis of jet substructure, corresponding to an integrated luminosity 
of approximately 37±1.3 pb _1 . The uncertainty of 3.4% on the luminosity is obtained from the latest luminosity 
calibration using the full set of van der Meer scans performed in 2010 [16]. 

A sample of high-px jets is selected via a single jet trigger with a nominal transverse energy threshold of 
95 GeV at the electromagnetic scale. The offline selection requires that the highest pt jet be within \rj\ < 2.0 and 
to have p^ > 300 GeV. Events are required to contain exactly one primary vertex in order to reduce the impact of 
mulitplc simultanous proton-proton interactions, although a dedicated study demonstrating the impact of such 
pile-up is also presented. The anti-/c t jet algorithm [17, 18] used for most of the results presented yields regularly 
shaped jets that are well-suited for calibration. The Cambridge- Aachen (C/A) jet algorithm [19, 20] is also 
very appropriate for measurements of jet substructure in two-body decays such as that expected from a boosted 
Higgs decaying to two b quarks. Jet radius parameters of R = 1.0 and R = 1.2 are used for the anti-fct and 
C/A algorithms, respectively. Jet quality criteria such as the timing of calorimeters cells, the electromagnetic 
energy fraction, and pulse shape information are used to select only those events with well measured jets. These 
criteria are designed to remove events that are likely to have contamination due to beam-related backgrounds or 
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detector defects. Since the primary vertex multiplicity is different between data and MC due to event pile-up, 
a re-weighting factor is applied to the MC prior to event selection. 

The data are compared to events simulated using two different approaches to MC event generation. The 
first uses direct perturbative calculation of the cross-section matrix elements in powers of the strong coupling 
constant, as- For QCD jet production, this approach is used by ALPGEN [21] and performed at leading-order 
(LO) in as for each relevant partonic subprocess. The second approach implements a sampling of the phase- 
space available for gluon emission with some suitable approximations. The simulation programs PYTHIA [22] 
and HERWIG++ [23] both implement this approach for QCD jet production and use LO perturbative calculations 
of matrix elements for 2 — > 2 processes and rely on the parton shower to produce the equivalent of multi-parton 
final states. In the analysis presented here, we utilize ALPGEN 2.13 with exact matrix element calculations up 
to n < 6 partons and interfaced to HERWIG 6.510 [24] to provide the parton shower and hadronization model 
and with JIMMY 4.31 [25] for the underlying event model. Comparisons are also made to PYTHIA 6.423 and 
HERWIG++ 2.4 which provide shower models that are p\ ordered and angular ordered, respectively. Leading- 
order parton density functions are taken from the MRST2007 LO* [26, 27] set, unless stated otherwise. 

The MC generated samples are passed through a full simulation of the ATLAS detector and trigger [28] based 
on GEANT4 [29]. The Quark Gluon String (QGSP) model [30] is used for the fragmentation of the nucleus, and 
the Bertini cascade (BERT) model [31] for the description of the interactions of the hadrons in the medium of 
the nucleus. Alternative GEANT4 physics lists, using a combination of the FRITIOF [32] and Bertini models 
and QGSP without Bertini are used as part of the studies to understand the uncertainties on the jet energy 
and mass scale. 



The jet mass and internal splitting scales are measured for a sample of inclusive jets with p0£ > 300 GeV [33]. 
The two jet algorithms, anti-fct and C/A, utilized for these measurements use wide characteristic radii (R — 1.0 
and R — 1.2, respectively) as this improves efficiency in the case of searches for new hadronically decaying 
heavy boosted objects. Furthermore, the jet filtering procedure introduced in Refs. [9, 10] is applied to the C/A 
algorithm and the resulting "groomed" jet mass is measured. 

Whereas the jet mass is the four-momentum sum of the clustered jet constituents, the k± splitting scale is 
defined by first reclustering the constituents of the jet with the k± algorithm [34, 35]. The final recombination 
step defines the splitting scale variable as 



where i,j represent the two proto-jets combined at the final step of the k t algorithm. Due to the definition of 
the distance metric for k t , the last recombination will often represent the two most widely separated and highest 
pxjet constituents. Consequently, for a two-body heavy particle decay the final clustering step will usually be 
to combine those two decay products. The parameter \fd~\i can therefore be used to distinguish heavy particle 
decays, which tend to be reasonably symmetric, from largely asymmetric QCD splittings. The expected value 
for a heavy particle decay is approximately m/2 whereas plain QCD jets will tend to have values < 20 GeV 

with a long continuous tail to high \J d\%. 

For C/A jets, the filtering jet grooming procedure aims to identify splittings in a jet which each contribute 
significantly to the jet invariant mass. The procedure is described in detail in Ref. [33]. At each step during the 
C/A clustering the masses of the proto-jets to be combined are recorded. This clustering history is then searched 
in reverse for a step at which the jet mass falls by approximately one-third while the px of the proto-jets is 
roughly symmetric. The jet is then reclustered with a smaller jet radius (typically R — 0.3), and if more than 
two subjets are found the three highest pt subjets are retained and used to calculate the final jet kinematics. 

The jet mass and splitting scale are corrected for detector detector effects via a bin-by-bin unfolding procedure. 
The final bin sizes are chosen such that bin migrations are small and the per-bin purity [41] is greater than 50%. 

Fully corrected hadron-level distributions of the jet mass and splitting scale are shown in Figures 1-2. Both 
the unfiltered C/A and filtered C/A jet mass distributions are shown in Figure 1. HERWIG+- 1- predicts slightly 
more massive jets than supported by the data whereas PYTHIA and HERWIG / JIMMY yield measurements which 
bracket the data and agree to within systematic uncertainties in all bins. Very notably, the differences between 
the mass distributions predicted by the various MC programs are greatly reduced after the filtering procedure 
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FIG. 1: Invariant mass spectrum of Cambridge-Aachen jets with pr > 300 GeV and \y\ < 2 (a) before and (b) after the 
splitting and filtering procedure has been applied. Both distributions are fully corrected for detector effects, systematic 
uncertainties are depicted by the shaded band. 



as shown in Figure 1(a). This observations suggests that the jet mass after the filtering procedure accurately 
represents the true hard components within the jets which are modeled well by the simulation. 

Figure 2 depicts both the fully corrected mass distribution and the k t splitting scale distribution for anti-/ct, 
R = 1.0 jets. A similar conclusion may be drawn in this case as in the C/A jets, wherein HERWIG++ tends to 
predict slightly more massive jets on average than observed in the data. However, in this case, the high mass 
tail of the distribution is actually better reflected in the HERWIG / JIMMY and HERWIG++ MC simulations than 
in PYTHIA. 

Figure 3 exemplifies the potential for adverse pile-up effects and the power of grooming techniques such as 
jet filtering. The increase in the mean jet invariant mass of calorimeter jets is measured as a function of the 
number of additional interactions in the event, Npy. Figure 3(a) highlights the dependence of this increase 
on the radius of the jet algorithm, growing linearly with R in the case of zero pile-up, whereas the slope as a 
function of pile-up shows an R 3 dependence. This behavior can be qualitatively understood by the growth in jet 
area as R 2 , and the contribution of additional particles to the jet mass scales with the distance between them 
~ R/2 yielding another power of R. Figure 3(b) shows the rise in the mean jet mass as a function of Npy before 
and after filtering. This procedure restores a flat dependence of the jet mass on Apy to within uncertainties. 



IV. BOOSTED HEAVY PARTICLES 



A handful of candidate boosted top quark events have been observed in the 2010 data and provide a unique 
first opportunity to test the techniques described above in-situ. This sample has been collected using the high 
purity lepton+jets channel, which has a signal-to-background (S/B) ratio on the order 2 to 3 [36]. A subsample 
was then selected to have m t i > 700 GeV. These events offer a fertile testing ground for the substructure 
techniques discussed. 

Figure 4 shows a single ATLAS event display for one of the selected candidate boosted top quark events. In 
this display, anti-fc t jets with R = 0.4 are indicated in red whereas large radius anti-fc t jets with R = 1.0 are 
shown in green in the same figure. The leading R — 0.4 jet in the event corresponds to the leptonic leg of the 
top event and has px = 199 GeV. It is also tagged as a 6-jet with high probability via both a track impact 
parameter-based tagger and via a secondary-vertex + impact parameter tagger. The leading R — 0.4 jet in the 
opposite hemisphere, corresponding to the hadronic leg of the event has pt = 157 GeV and no fo-tag, whereas 
the second leading jet in the hemisphere of the hadronic candidate has a strong tag. When re-clustering the 
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FIG. 2: (a) Invariant mass spectrum of anti-fct jets with pr > 300 GeV and \y\ < 2 and (b) Vfli2 distribution for the 
same jets. Both distributions are fully corrected for detector effects, systematic uncertainties are depicted by the shaded 
band. 
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FIG. 3: The mean mass for jets with pr > 300 GeV as a function of the number of primary vertices (Npy) identified 
in the event. Comparisons show the effect for anti-fet jets with different resolution parameters (a) and C/A, R — 1.2 jets 
with and without splitting and filtering procedure (b). Each set of points is fitted with a straight line. 



event with R — 1.0, the jet corresponding to the hadronic leg has pt = 327 GeV and m? = 206 GeV as 
well as \fd~v2 = 110 GeV and \J — 40 GeV. The value of \J d^z is suggestive of the presence of a boosted 
hadronically decaying W boson reconstructed within this jet. These observations render this event to be a 
strong candidate for a boosted hadronically decaying top quark event. 

Lastly, a sample of events selected to contain a W — > tv candidate consistent with having a W boson 
p% > 200 GeV is analyzed [37]. The jet mass distribution of filtered C/A, R = 1.2 jets with p T > 180 GeV and 



Proceedings of the DPF-2011 Conference, Providence, RI, August 8-13, 2011 



5 




FIG. 4: Leptonic top candidate formed by high pr electron (145 GeV), moderate Z?J" SS , and a 6-tagged jet. When 
reclustered with R = 1.0 the leptonic candidate acquires a large pr, mass and 1 — > 2 splitting scale as it absorbs the 
electron. Three jets are identified with the hadronic top quark. When reclustered with R — 1.0, these jets merge into a 
single jet with mj e t = 197 GeV, Vdi2 = 110 GeV, and Vcfe = 40 GeV. Jets indicated in red correspond to R = 0.4, jets 
in green to R = 1.0. 

A0n/jct > 1.2 in these events is shown in Figure 5. A clear mass peak near the W mass indicates the presence of 
boosted hadronic W's. The three main contributions to these events are ti (generated with MC@NLO +HERWIG 
/JIMMY [38, 39]), W+jets (generated with ALPGEN +HERWIG /JIMMY ), and WW (generated with HERWIG/ JIMMY), 
all normalized to the highest order cross-section available (see Ref. [37] for more details). The good agreement 
between data and the various MC simulations suggests both that the tools described above are well described 
in a complex physics environment and that the systematics are generally well under control. 

V. CONCLUSIONS 

The study of the hadronic final state is explored in terms of the substructure of hadronic jets via measurements 
of the jet mass and k t splitting scales for anti-fc t jets with R = 1.0 and C/A jets with R = 1.2. These 
measurements are corrected for detector effects and a full systematic uncertainty evaluation is performed. In 
addition, the jet mass is measured for C/A jets after a splitting and filtering procedure. In all observables the 
PYTHIA and HERWIG samples are in agreement with data to within the systematic uncertainties. The HERWIG++ 
prediction appears to be slightly disfavoured in the unfiltered C/A mass spectra, producing jets with a higher 
mass than found in data. 

The effect of pile-up on jet mass has also been studied. The expected scaling with number of vertices and jet 
radius is observed. It has also been shown that the filtering procedure applied to C/A jets reduces the impact 
of pile-up on jet mass to the extent that it is undetectable in the 2010 dataset. 

Finally, the applications of these techniques is demonstrated in the tagging of candidate boosted top quarks 
and the measurement of fully hadronic W decays. 

Overall it is clear that ATLAS is capable of delivering measurements of the variables considered in this study 
and that these observables are well modeled by leading order Monte Carlo. Searches for boosted Higgs bosons, 
supersymmetric particles, and top-quark resonances will all benefit from utilizing these techniques. 
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FIG. 5: The jet mass distribution of subjets with pr > 180 GeV in events consistent with a,W^rlv boson decay with 
pr > 200 GeV. The distribution is compared to the uncorrected MC simulation prediction for ti, W+jets and WW 
processes. 
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